Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 7697 |
| Missing cells | 13107 |
| Missing cells (%) | 7.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 699.2 KiB |
| Average record size in memory | 93.0 B |
Variable types
| Numeric | 17 |
|---|---|
| Categorical | 6 |
df_index is highly correlated with SampleID | High correlation |
SampleID is highly correlated with df_index | High correlation |
incident_diabetes is highly correlated with diabetes_time | High correlation |
diabetes_time is highly correlated with incident_diabetes | High correlation |
SBP is highly correlated with DBP | High correlation |
DBP is highly correlated with SBP | High correlation |
fasting_glucose is highly correlated with HbA1c | High correlation |
HbA1c is highly correlated with fasting_glucose | High correlation |
healthy_vegetables is highly correlated with total_fiber | High correlation |
total_fiber is highly correlated with healthy_vegetables | High correlation |
df_index is highly correlated with SampleID | High correlation |
SampleID is highly correlated with df_index | High correlation |
age is highly correlated with junk_food | High correlation |
BMI is highly correlated with fasting_insulin | High correlation |
SBP is highly correlated with DBP | High correlation |
DBP is highly correlated with SBP | High correlation |
fasting_insulin is highly correlated with BMI | High correlation |
healthy_vegetables is highly correlated with total_fiber | High correlation |
junk_food is highly correlated with age | High correlation |
total_fiber is highly correlated with healthy_vegetables | High correlation |
df_index is highly correlated with SampleID and 4 other fields | High correlation |
SampleID is highly correlated with df_index and 4 other fields | High correlation |
incident_diabetes is highly correlated with df_index and 4 other fields | High correlation |
diabetes_time is highly correlated with male and 3 other fields | High correlation |
age is highly correlated with current_smoker and 1 other fields | High correlation |
male is highly correlated with diabetes_time and 3 other fields | High correlation |
BMI is highly correlated with current_smoker and 1 other fields | High correlation |
HDL is highly correlated with hypertension and 2 other fields | High correlation |
LDL is highly correlated with hypertension and 2 other fields | High correlation |
trig is highly correlated with hypertension and 2 other fields | High correlation |
SBP is highly correlated with current_smoker and 1 other fields | High correlation |
DBP is highly correlated with current_smoker and 1 other fields | High correlation |
hypertension is highly correlated with df_index and 9 other fields | High correlation |
fasting is highly correlated with current_smoker and 1 other fields | High correlation |
fasting_glucose is highly correlated with current_smoker and 1 other fields | High correlation |
fasting_insulin is highly correlated with current_smoker and 2 other fields | High correlation |
HbA1c is highly correlated with current_smoker and 1 other fields | High correlation |
current_smoker is highly correlated with df_index and 17 other fields | High correlation |
ex_smoker is highly correlated with df_index and 17 other fields | High correlation |
exercise is highly correlated with fasting_insulin | High correlation |
healthy_vegetables is highly correlated with total_fiber | High correlation |
total_fiber is highly correlated with healthy_vegetables | High correlation |
incident_diabetes is highly correlated with diabetes_time | High correlation |
DBP is highly correlated with SBP | High correlation |
total_fiber is highly correlated with healthy_vegetables | High correlation |
diabetes_time is highly correlated with incident_diabetes and 1 other fields | High correlation |
df_index is highly correlated with SampleID | High correlation |
SBP is highly correlated with DBP and 1 other fields | High correlation |
HbA1c is highly correlated with fasting_glucose | High correlation |
SampleID is highly correlated with df_index | High correlation |
healthy_vegetables is highly correlated with total_fiber | High correlation |
hypertension is highly correlated with SBP | High correlation |
fasting_glucose is highly correlated with diabetes_time and 1 other fields | High correlation |
fasting_glucose has 4421 (57.4%) missing values | Missing |
fasting_insulin has 4421 (57.4%) missing values | Missing |
HbA1c has 3348 (43.5%) missing values | Missing |
exercise has 112 (1.5%) missing values | Missing |
healthy_vegetables has 168 (2.2%) missing values | Missing |
junk_food has 136 (1.8%) missing values | Missing |
total_fiber has 419 (5.4%) missing values | Missing |
df_index is uniformly distributed | Uniform |
SampleID is uniformly distributed | Uniform |
df_index has unique values | Unique |
SampleID has unique values | Unique |
Reproduction
| Analysis started | 2021-08-13 20:10:22.380968 |
|---|---|
| Analysis finished | 2021-08-13 20:11:06.211420 |
| Duration | 43.83 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
df_index
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORMUNIQUE| Distinct | 7697 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4148.146421 |
| Minimum | 1 |
|---|---|
| Maximum | 8290 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 60.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 423.8 |
| Q1 | 2076 |
| median | 4147 |
| Q3 | 6217 |
| 95-th percentile | 7878.2 |
| Maximum | 8290 |
| Range | 8289 |
| Interquartile range (IQR) | 4141 |
Descriptive statistics
| Standard deviation | 2390.894749 |
|---|---|
| Coefficient of variation (CV) | 0.5763766527 |
| Kurtosis | -1.196031006 |
| Mean | 4148.146421 |
| Median Absolute Deviation (MAD) | 2071 |
| Skewness | 0.001183643904 |
| Sum | 31928283 |
| Variance | 5716377.7 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 5431 | 1 | < 0.1% |
| 5527 | 1 | < 0.1% |
| 5526 | 1 | < 0.1% |
| 5525 | 1 | < 0.1% |
| 5524 | 1 | < 0.1% |
| 5522 | 1 | < 0.1% |
| 5521 | 1 | < 0.1% |
| 5520 | 1 | < 0.1% |
| 5519 | 1 | < 0.1% |
| Other values (7687) | 7687 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 8290 | 1 | |
| 8289 | 1 | |
| 8288 | 1 | |
| 8287 | 1 | |
| 8286 | 1 | |
| 8285 | 1 | |
| 8284 | 1 | |
| 8283 | 1 | |
| 8282 | 1 | |
| 8281 | 1 |
SampleID
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORMUNIQUE| Distinct | 7697 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4149.146421 |
| Minimum | 2 |
|---|---|
| Maximum | 8291 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 424.8 |
| Q1 | 2077 |
| median | 4148 |
| Q3 | 6218 |
| 95-th percentile | 7879.2 |
| Maximum | 8291 |
| Range | 8289 |
| Interquartile range (IQR) | 4141 |
Descriptive statistics
| Standard deviation | 2390.894749 |
|---|---|
| Coefficient of variation (CV) | 0.5762377382 |
| Kurtosis | -1.196031006 |
| Mean | 4149.146421 |
| Median Absolute Deviation (MAD) | 2071 |
| Skewness | 0.001183643904 |
| Sum | 31935980 |
| Variance | 5716377.7 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 5432 | 1 | < 0.1% |
| 5528 | 1 | < 0.1% |
| 5527 | 1 | < 0.1% |
| 5526 | 1 | < 0.1% |
| 5525 | 1 | < 0.1% |
| 5523 | 1 | < 0.1% |
| 5522 | 1 | < 0.1% |
| 5521 | 1 | < 0.1% |
| 5520 | 1 | < 0.1% |
| Other values (7687) | 7687 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 |
| Value | Count | Frequency (%) |
| 8291 | 1 | |
| 8290 | 1 | |
| 8289 | 1 | |
| 8288 | 1 | |
| 8287 | 1 | |
| 8286 | 1 | |
| 8285 | 1 | |
| 8284 | 1 | |
| 8283 | 1 | |
| 8282 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 436.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7697 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 6993 | |
| 1 | 704 | 9.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 6993 | |
| 1 | 704 | 9.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6993 | |
| 1 | 704 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7697 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6993 | |
| 1 | 704 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7697 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6993 | |
| 1 | 704 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7697 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6993 | |
| 1 | 704 | 9.1% |
| Distinct | 866 |
|---|---|
| Distinct (%) | 11.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.73929214 |
| Minimum | 0.02999999933 |
|---|---|
| Maximum | 14.96000004 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 0.02999999933 |
|---|---|
| 5-th percentile | 6.079999924 |
| Q1 | 14.76000023 |
| median | 14.81999969 |
| Q3 | 14.88000011 |
| 95-th percentile | 14.93000031 |
| Maximum | 14.96000004 |
| Range | 14.93000004 |
| Interquartile range (IQR) | 0.1199998856 |
Descriptive statistics
| Standard deviation | 2.935263634 |
|---|---|
| Coefficient of variation (CV) | 0.2136400938 |
| Kurtosis | 7.112100601 |
| Mean | 13.73929214 |
| Median Absolute Deviation (MAD) | 0.06000041962 |
| Skewness | -2.81443119 |
| Sum | 105751.3281 |
| Variance | 8.615773201 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14.81999969 | 515 | 6.7% |
| 14.88000011 | 492 | 6.4% |
| 14.80000019 | 476 | 6.2% |
| 14.84000015 | 450 | 5.8% |
| 14.92000008 | 409 | 5.3% |
| 14.89999962 | 391 | 5.1% |
| 14.85999966 | 361 | 4.7% |
| 14.93999958 | 281 | 3.7% |
| 14.93000031 | 272 | 3.5% |
| 14.77999973 | 262 | 3.4% |
| Other values (856) | 3788 |
| Value | Count | Frequency (%) |
| 0.02999999933 | 1 | |
| 0.07999999821 | 1 | |
| 0.1400000006 | 1 | |
| 0.150000006 | 2 | |
| 0.1700000018 | 1 | |
| 0.2099999934 | 1 | |
| 0.25 | 1 | |
| 0.3300000131 | 2 | |
| 0.400000006 | 2 | |
| 0.4199999869 | 2 |
| Value | Count | Frequency (%) |
| 14.96000004 | 2 | < 0.1% |
| 14.93999958 | 281 | |
| 14.93000031 | 272 | |
| 14.92000008 | 409 | |
| 14.90999985 | 262 | |
| 14.89999962 | 391 | |
| 14.89000034 | 223 | |
| 14.88000011 | 492 | |
| 14.86999989 | 251 | |
| 14.85999966 | 361 |
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.59828505 |
| Minimum | 24 |
|---|---|
| Maximum | 74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 24 |
|---|---|
| 5-th percentile | 27 |
| Q1 | 37 |
| median | 48 |
| Q3 | 58 |
| 95-th percentile | 69 |
| Maximum | 74 |
| Range | 50 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 13.13774328 |
|---|---|
| Coefficient of variation (CV) | 0.2760129544 |
| Kurtosis | -1.008195018 |
| Mean | 47.59828505 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.02104560191 |
| Sum | 366364 |
| Variance | 172.6002985 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 56 | 256 | 3.3% |
| 55 | 243 | 3.2% |
| 54 | 203 | 2.6% |
| 43 | 195 | 2.5% |
| 42 | 193 | 2.5% |
| 39 | 188 | 2.4% |
| 52 | 187 | 2.4% |
| 37 | 187 | 2.4% |
| 50 | 186 | 2.4% |
| 44 | 182 | 2.4% |
| Other values (41) | 5677 |
| Value | Count | Frequency (%) |
| 24 | 70 | |
| 25 | 142 | |
| 26 | 167 | |
| 27 | 139 | |
| 28 | 159 | |
| 29 | 131 | |
| 30 | 149 | |
| 31 | 146 | |
| 32 | 157 | |
| 33 | 165 |
| Value | Count | Frequency (%) |
| 74 | 38 | 0.5% |
| 73 | 73 | |
| 72 | 91 | |
| 71 | 78 | |
| 70 | 70 | |
| 69 | 91 | |
| 68 | 100 | |
| 67 | 110 | |
| 66 | 101 | |
| 65 | 85 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 436.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7697 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 4092 | |
| 1 | 3605 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 4092 | |
| 1 | 3605 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4092 | |
| 1 | 3605 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7697 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4092 | |
| 1 | 3605 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7697 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4092 | |
| 1 | 3605 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7697 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4092 | |
| 1 | 3605 |
| Distinct | 7548 |
|---|---|
| Distinct (%) | 98.1% |
| Missing | 4 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.72398376 |
| Minimum | 15.83675671 |
|---|---|
| Maximum | 55.97962952 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 15.83675671 |
|---|---|
| 5-th percentile | 20.50849838 |
| Q1 | 23.5651207 |
| median | 26.17281914 |
| Q3 | 29.18943024 |
| 95-th percentile | 35.03690567 |
| Maximum | 55.97962952 |
| Range | 40.14287281 |
| Interquartile range (IQR) | 5.62430954 |
Descriptive statistics
| Standard deviation | 4.526644707 |
|---|---|
| Coefficient of variation (CV) | 0.1693851054 |
| Kurtosis | 1.69921124 |
| Mean | 26.72398376 |
| Median Absolute Deviation (MAD) | 2.766759872 |
| Skewness | 0.9362901449 |
| Sum | 205587.6094 |
| Variance | 20.49051285 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24.03461075 | 3 | < 0.1% |
| 21.51691055 | 3 | < 0.1% |
| 28.3599205 | 2 | < 0.1% |
| 20.75844002 | 2 | < 0.1% |
| 24.48605919 | 2 | < 0.1% |
| 24.46030045 | 2 | < 0.1% |
| 22.68431091 | 2 | < 0.1% |
| 21.77596092 | 2 | < 0.1% |
| 28.70982933 | 2 | < 0.1% |
| 24.14126968 | 2 | < 0.1% |
| Other values (7538) | 7671 | |
| (Missing) | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 15.83675671 | 1 | |
| 16.37446404 | 1 | |
| 16.78290749 | 1 | |
| 16.81723022 | 1 | |
| 16.89015007 | 1 | |
| 16.97460938 | 1 | |
| 16.97692108 | 1 | |
| 17.01846886 | 1 | |
| 17.15185928 | 1 | |
| 17.23607063 | 1 |
| Value | Count | Frequency (%) |
| 55.97962952 | 1 | |
| 53.3474617 | 1 | |
| 52.01060104 | 1 | |
| 51.04481125 | 1 | |
| 48.86034393 | 1 | |
| 48.5961113 | 1 | |
| 48.42119217 | 1 | |
| 48.37353897 | 1 | |
| 48.09280014 | 1 | |
| 46.77568817 | 1 |
| Distinct | 261 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.510970473 |
| Minimum | 0.3100000024 |
|---|---|
| Maximum | 4.199999809 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 0.3100000024 |
|---|---|
| 5-th percentile | 0.9399999976 |
| Q1 | 1.210000038 |
| median | 1.460000038 |
| Q3 | 1.75 |
| 95-th percentile | 2.289999962 |
| Maximum | 4.199999809 |
| Range | 3.889999807 |
| Interquartile range (IQR) | 0.5399999619 |
Descriptive statistics
| Standard deviation | 0.4159130454 |
|---|---|
| Coefficient of variation (CV) | 0.2752622068 |
| Kurtosis | 1.220328689 |
| Mean | 1.510970473 |
| Median Absolute Deviation (MAD) | 0.2599999905 |
| Skewness | 0.830974102 |
| Sum | 11629.93945 |
| Variance | 0.1729836613 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.299999952 | 95 | 1.2% |
| 1.49000001 | 91 | 1.2% |
| 1.409999967 | 88 | 1.1% |
| 1.399999976 | 85 | 1.1% |
| 1.549999952 | 85 | 1.1% |
| 1.190000057 | 84 | 1.1% |
| 1.179999948 | 84 | 1.1% |
| 1.24000001 | 84 | 1.1% |
| 1.370000005 | 83 | 1.1% |
| 1.379999995 | 83 | 1.1% |
| Other values (251) | 6835 |
| Value | Count | Frequency (%) |
| 0.3100000024 | 1 | |
| 0.4699999988 | 1 | |
| 0.5 | 1 | |
| 0.5199999809 | 2 | |
| 0.5299999714 | 1 | |
| 0.5899999738 | 2 | |
| 0.6000000238 | 2 | |
| 0.6200000048 | 1 | |
| 0.6399999857 | 2 | |
| 0.6499999762 | 2 |
| Value | Count | Frequency (%) |
| 4.199999809 | 1 | |
| 3.769999981 | 1 | |
| 3.75999999 | 1 | |
| 3.720000029 | 1 | |
| 3.609999895 | 1 | |
| 3.5 | 1 | |
| 3.420000076 | 1 | |
| 3.410000086 | 1 | |
| 3.160000086 | 2 | |
| 3.150000095 | 1 |
| Distinct | 513 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 14 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.356531382 |
| Minimum | 0.7699999809 |
|---|---|
| Maximum | 9.550000191 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 0.7699999809 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2.720000029 |
| median | 3.299999952 |
| Q3 | 3.910000086 |
| 95-th percentile | 4.929999828 |
| Maximum | 9.550000191 |
| Range | 8.78000021 |
| Interquartile range (IQR) | 1.190000057 |
Descriptive statistics
| Standard deviation | 0.9024052024 |
|---|---|
| Coefficient of variation (CV) | 0.2688505054 |
| Kurtosis | 0.8598174453 |
| Mean | 3.356531382 |
| Median Absolute Deviation (MAD) | 0.5900001526 |
| Skewness | 0.5335271358 |
| Sum | 25788.23047 |
| Variance | 0.8143351078 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.799999952 | 49 | 0.6% |
| 3.589999914 | 47 | 0.6% |
| 3.089999914 | 45 | 0.6% |
| 2.900000095 | 43 | 0.6% |
| 3.339999914 | 43 | 0.6% |
| 3.24000001 | 43 | 0.6% |
| 3.400000095 | 42 | 0.5% |
| 2.690000057 | 42 | 0.5% |
| 3.430000067 | 42 | 0.5% |
| 3.299999952 | 42 | 0.5% |
| Other values (503) | 7245 |
| Value | Count | Frequency (%) |
| 0.7699999809 | 1 | |
| 0.8000000119 | 1 | |
| 0.8399999738 | 1 | |
| 0.8600000143 | 1 | |
| 1 | 1 | |
| 1.00999999 | 1 | |
| 1.029999971 | 1 | |
| 1.039999962 | 1 | |
| 1.059999943 | 1 | |
| 1.080000043 | 1 |
| Value | Count | Frequency (%) |
| 9.550000191 | 1 | |
| 8.93999958 | 1 | |
| 7.840000153 | 1 | |
| 7.760000229 | 1 | |
| 7.440000057 | 1 | |
| 7.400000095 | 1 | |
| 7.139999866 | 2 | |
| 6.940000057 | 1 | |
| 6.889999866 | 1 | |
| 6.829999924 | 1 |
| Distinct | 469 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.407413244 |
| Minimum | 0.2700000107 |
|---|---|
| Maximum | 18.45999908 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 0.2700000107 |
|---|---|
| 5-th percentile | 0.5899999738 |
| Q1 | 0.8399999738 |
| median | 1.159999967 |
| Q3 | 1.659999967 |
| 95-th percentile | 3.019999981 |
| Maximum | 18.45999908 |
| Range | 18.18999907 |
| Interquartile range (IQR) | 0.8199999928 |
Descriptive statistics
| Standard deviation | 0.9503949881 |
|---|---|
| Coefficient of variation (CV) | 0.6752778292 |
| Kurtosis | 44.55989075 |
| Mean | 1.407413244 |
| Median Absolute Deviation (MAD) | 0.3699999452 |
| Skewness | 4.537890911 |
| Sum | 10832.85938 |
| Variance | 0.9032506347 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.8299999833 | 85 | 1.1% |
| 0.8799999952 | 76 | 1.0% |
| 1.00999999 | 75 | 1.0% |
| 0.8199999928 | 73 | 0.9% |
| 0.7599999905 | 73 | 0.9% |
| 0.9800000191 | 72 | 0.9% |
| 0.8100000024 | 71 | 0.9% |
| 0.9200000167 | 70 | 0.9% |
| 0.7900000215 | 70 | 0.9% |
| 0.8000000119 | 69 | 0.9% |
| Other values (459) | 6963 |
| Value | Count | Frequency (%) |
| 0.2700000107 | 2 | < 0.1% |
| 0.3000000119 | 1 | < 0.1% |
| 0.3100000024 | 1 | < 0.1% |
| 0.3199999928 | 1 | < 0.1% |
| 0.3300000131 | 1 | < 0.1% |
| 0.3400000036 | 1 | < 0.1% |
| 0.3600000143 | 5 | |
| 0.3700000048 | 2 | < 0.1% |
| 0.3799999952 | 5 | |
| 0.3899999857 | 8 |
| Value | Count | Frequency (%) |
| 18.45999908 | 1 | |
| 17.23999977 | 1 | |
| 14.15999985 | 1 | |
| 12.75 | 1 | |
| 11.73999977 | 1 | |
| 11 | 1 | |
| 10.67000008 | 1 | |
| 10.5 | 1 | |
| 10.21000004 | 1 | |
| 9.979999542 | 1 |
| Distinct | 134 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 134.715683 |
| Minimum | 89 |
|---|---|
| Maximum | 228 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 89 |
|---|---|
| 5-th percentile | 108 |
| Q1 | 120 |
| median | 132 |
| Q3 | 146 |
| 95-th percentile | 171 |
| Maximum | 228 |
| Range | 139 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 19.81209183 |
|---|---|
| Coefficient of variation (CV) | 0.1470659673 |
| Kurtosis | 0.7745363712 |
| Mean | 134.715683 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.8007217646 |
| Sum | 1036502.5 |
| Variance | 392.5190125 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 124 | 184 | 2.4% |
| 127 | 181 | 2.4% |
| 126 | 177 | 2.3% |
| 121 | 177 | 2.3% |
| 129 | 174 | 2.3% |
| 128 | 174 | 2.3% |
| 125 | 174 | 2.3% |
| 130 | 174 | 2.3% |
| 123 | 173 | 2.2% |
| 131 | 172 | 2.2% |
| Other values (124) | 5934 |
| Value | Count | Frequency (%) |
| 89 | 1 | < 0.1% |
| 91 | 2 | < 0.1% |
| 92 | 2 | < 0.1% |
| 93 | 2 | < 0.1% |
| 94 | 3 | < 0.1% |
| 95 | 4 | 0.1% |
| 96 | 11 | |
| 97 | 8 | |
| 98 | 11 | |
| 99 | 15 |
| Value | Count | Frequency (%) |
| 228 | 1 | < 0.1% |
| 227 | 2 | |
| 226 | 1 | < 0.1% |
| 219 | 1 | < 0.1% |
| 218 | 1 | < 0.1% |
| 217 | 1 | < 0.1% |
| 215 | 1 | < 0.1% |
| 214 | 3 | |
| 213 | 1 | < 0.1% |
| 212 | 1 | < 0.1% |
| Distinct | 83 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.01624298 |
| Minimum | 39 |
|---|---|
| Maximum | 126 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 39 |
|---|---|
| 5-th percentile | 61 |
| Q1 | 71 |
| median | 79 |
| Q3 | 87 |
| 95-th percentile | 98 |
| Maximum | 126 |
| Range | 87 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 11.33845806 |
|---|---|
| Coefficient of variation (CV) | 0.1434952766 |
| Kurtosis | 0.1288583875 |
| Mean | 79.01624298 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.1495639384 |
| Sum | 607951 |
| Variance | 128.5606384 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 77 | 305 | 4.0% |
| 79 | 293 | 3.8% |
| 83 | 268 | 3.5% |
| 81 | 265 | 3.4% |
| 80 | 263 | 3.4% |
| 78 | 263 | 3.4% |
| 74 | 260 | 3.4% |
| 75 | 256 | 3.3% |
| 85 | 247 | 3.2% |
| 76 | 245 | 3.2% |
| Other values (73) | 5029 |
| Value | Count | Frequency (%) |
| 39 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| 45 | 2 | < 0.1% |
| 46 | 5 | |
| 47 | 1 | < 0.1% |
| 48 | 4 | |
| 49 | 8 | |
| 50 | 5 |
| Value | Count | Frequency (%) |
| 126 | 1 | < 0.1% |
| 124 | 2 | |
| 123 | 1 | < 0.1% |
| 122 | 1 | < 0.1% |
| 120 | 2 | |
| 119 | 1 | < 0.1% |
| 117 | 3 | |
| 116 | 4 | |
| 115 | 4 | |
| 114 | 4 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 451.1 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23091 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 5837 | |
| 1.0 | 1860 | 24.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 5837 | |
| 1.0 | 1860 | 24.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 13534 | |
| . | 7697 | |
| 1 | 1860 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15394 | |
| Other Punctuation | 7697 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 13534 | |
| 1 | 1860 | 12.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7697 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23091 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 13534 | |
| . | 7697 | |
| 1 | 1860 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23091 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 13534 | |
| . | 7697 | |
| 1 | 1860 | 8.1% |
| Distinct | 27 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 20 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.845252037 |
| Minimum | 0 |
|---|---|
| Maximum | 31 |
| Zeros | 4 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 5 |
| Q3 | 6 |
| 95-th percentile | 14 |
| Maximum | 31 |
| Range | 31 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.108305454 |
|---|---|
| Coefficient of variation (CV) | 0.5317658782 |
| Kurtosis | 6.96020937 |
| Mean | 5.845252037 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.49319315 |
| Sum | 44874 |
| Variance | 9.66156292 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 2756 | |
| 4 | 1709 | |
| 6 | 1365 | |
| 7 | 450 | 5.8% |
| 3 | 238 | 3.1% |
| 2 | 189 | 2.5% |
| 8 | 160 | 2.1% |
| 15 | 112 | 1.5% |
| 14 | 110 | 1.4% |
| 16 | 99 | 1.3% |
| Other values (17) | 489 | 6.4% |
| Value | Count | Frequency (%) |
| 0 | 4 | 0.1% |
| 1 | 71 | 0.9% |
| 2 | 189 | 2.5% |
| 3 | 238 | 3.1% |
| 4 | 1709 | |
| 5 | 2756 | |
| 6 | 1365 | |
| 7 | 450 | 5.8% |
| 8 | 160 | 2.1% |
| 9 | 66 | 0.9% |
| Value | Count | Frequency (%) |
| 31 | 1 | < 0.1% |
| 28 | 2 | < 0.1% |
| 24 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 3 | < 0.1% |
| 21 | 3 | < 0.1% |
| 20 | 14 | 0.2% |
| 19 | 13 | 0.2% |
| 18 | 48 | |
| 17 | 59 |
| Distinct | 362 |
|---|---|
| Distinct (%) | 11.1% |
| Missing | 4421 |
| Missing (%) | 57.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.829258442 |
| Minimum | 4.159999847 |
|---|---|
| Maximum | 18.81999969 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 4.159999847 |
|---|---|
| 5-th percentile | 4.900000095 |
| Q1 | 5.360000134 |
| median | 5.71999979 |
| Q3 | 6.130000114 |
| 95-th percentile | 6.960000038 |
| Maximum | 18.81999969 |
| Range | 14.65999985 |
| Interquartile range (IQR) | 0.7699999809 |
Descriptive statistics
| Standard deviation | 0.8572081923 |
|---|---|
| Coefficient of variation (CV) | 0.1470527053 |
| Kurtosis | 58.63638687 |
| Mean | 5.829258442 |
| Median Absolute Deviation (MAD) | 0.3799996376 |
| Skewness | 5.31879282 |
| Sum | 19096.65039 |
| Variance | 0.734805882 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.739999771 | 35 | 0.5% |
| 5.380000114 | 34 | 0.4% |
| 5.480000019 | 33 | 0.4% |
| 5.619999886 | 29 | 0.4% |
| 5.71999979 | 29 | 0.4% |
| 5.820000172 | 29 | 0.4% |
| 5.53000021 | 29 | 0.4% |
| 5.630000114 | 28 | 0.4% |
| 5.590000153 | 28 | 0.4% |
| 5.610000134 | 27 | 0.4% |
| Other values (352) | 2975 | |
| (Missing) | 4421 |
| Value | Count | Frequency (%) |
| 4.159999847 | 1 | < 0.1% |
| 4.260000229 | 1 | < 0.1% |
| 4.269999981 | 1 | < 0.1% |
| 4.320000172 | 1 | < 0.1% |
| 4.329999924 | 1 | < 0.1% |
| 4.389999866 | 1 | < 0.1% |
| 4.400000095 | 1 | < 0.1% |
| 4.409999847 | 1 | < 0.1% |
| 4.440000057 | 3 | |
| 4.449999809 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 18.81999969 | 1 | |
| 17.13999939 | 1 | |
| 16.87000084 | 1 | |
| 16.70999908 | 1 | |
| 16.26000023 | 1 | |
| 14.64000034 | 1 | |
| 13.86999989 | 1 | |
| 13.42000008 | 1 | |
| 12.93999958 | 1 | |
| 12.71000004 | 1 |
| Distinct | 288 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 4421 |
| Missing (%) | 57.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.934035301 |
| Minimum | 0.8000000119 |
|---|---|
| Maximum | 117.0999985 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 0.8000000119 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 5.099999905 |
| median | 7.300000191 |
| Q3 | 10.80000019 |
| 95-th percentile | 19.79999924 |
| Maximum | 117.0999985 |
| Range | 116.2999985 |
| Interquartile range (IQR) | 5.700000286 |
Descriptive statistics
| Standard deviation | 6.581742764 |
|---|---|
| Coefficient of variation (CV) | 0.7367043495 |
| Kurtosis | 42.87613678 |
| Mean | 8.934035301 |
| Median Absolute Deviation (MAD) | 2.600000381 |
| Skewness | 4.44527483 |
| Sum | 29267.90039 |
| Variance | 43.31933594 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.199999809 | 48 | 0.6% |
| 5.300000191 | 46 | 0.6% |
| 6.900000095 | 45 | 0.6% |
| 5.099999905 | 44 | 0.6% |
| 6 | 43 | 0.6% |
| 5.800000191 | 41 | 0.5% |
| 4.5 | 41 | 0.5% |
| 7 | 40 | 0.5% |
| 5 | 39 | 0.5% |
| 7.5 | 39 | 0.5% |
| Other values (278) | 2850 | |
| (Missing) | 4421 |
| Value | Count | Frequency (%) |
| 0.8000000119 | 2 | < 0.1% |
| 1 | 1 | < 0.1% |
| 1.200000048 | 2 | < 0.1% |
| 1.299999952 | 2 | < 0.1% |
| 1.399999976 | 1 | < 0.1% |
| 1.5 | 4 | 0.1% |
| 1.600000024 | 7 | |
| 1.700000048 | 5 | |
| 1.799999952 | 6 | |
| 1.899999976 | 10 |
| Value | Count | Frequency (%) |
| 117.0999985 | 1 | |
| 91.69999695 | 1 | |
| 75.19999695 | 1 | |
| 71.69999695 | 1 | |
| 67 | 1 | |
| 56.20000076 | 1 | |
| 55.90000153 | 1 | |
| 55 | 1 | |
| 53 | 1 | |
| 52.79999924 | 1 |
| Distinct | 43 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 3348 |
| Missing (%) | 43.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.00069046 |
| Minimum | 20 |
|---|---|
| Maximum | 142 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 30 |
| Q1 | 33 |
| median | 36 |
| Q3 | 38 |
| 95-th percentile | 42 |
| Maximum | 142 |
| Range | 122 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.656335831 |
|---|---|
| Coefficient of variation (CV) | 0.1293401867 |
| Kurtosis | 86.22382355 |
| Mean | 36.00069046 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 4.902283192 |
| Sum | 156567 |
| Variance | 21.68146324 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36 | 565 | 7.3% |
| 34 | 550 | 7.1% |
| 37 | 538 | 7.0% |
| 38 | 451 | 5.9% |
| 33 | 442 | 5.7% |
| 39 | 319 | 4.1% |
| 32 | 312 | 4.1% |
| 40 | 253 | 3.3% |
| 31 | 213 | 2.8% |
| 41 | 139 | 1.8% |
| Other values (33) | 567 | 7.4% |
| (Missing) | 3348 |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 25 | 4 | 0.1% |
| 26 | 19 | 0.2% |
| 27 | 35 | 0.5% |
| 28 | 42 | 0.5% |
| 29 | 68 | 0.9% |
| 30 | 114 | |
| 31 | 213 |
| Value | Count | Frequency (%) |
| 142 | 1 | |
| 105 | 1 | |
| 103 | 1 | |
| 79 | 1 | |
| 74 | 1 | |
| 70 | 1 | |
| 64 | 1 | |
| 62 | 1 | |
| 58 | 1 | |
| 57 | 2 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 38 |
| Missing (%) | 0.5% |
| Memory size | 450.4 KiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 22977 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 5647 | |
| 1.0 | 2012 | 26.1% |
| (Missing) | 38 | 0.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 5647 | |
| 1.0 | 2012 | 26.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 13306 | |
| . | 7659 | |
| 1 | 2012 | 8.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15318 | |
| Other Punctuation | 7659 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 13306 | |
| 1 | 2012 | 13.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7659 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22977 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 13306 | |
| . | 7659 | |
| 1 | 2012 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22977 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 13306 | |
| . | 7659 | |
| 1 | 2012 | 8.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 436.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7697 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 6058 | |
| 1 | 1639 | 21.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 6058 | |
| 1 | 1639 | 21.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6058 | |
| 1 | 1639 | 21.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7697 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6058 | |
| 1 | 1639 | 21.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7697 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6058 | |
| 1 | 1639 | 21.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7697 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6058 | |
| 1 | 1639 | 21.3% |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 112 |
| Missing (%) | 1.5% |
| Memory size | 448.9 KiB |
| 2.0 | |
|---|---|
| 3.0 | |
| 1.0 | |
| 4.0 | 91 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 22755 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 1.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 4100 | |
| 3.0 | 1702 | |
| 1.0 | 1692 | |
| 4.0 | 91 | 1.2% |
| (Missing) | 112 | 1.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2.0 | 4100 | |
| 3.0 | 1702 | |
| 1.0 | 1692 | |
| 4.0 | 91 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 7585 | |
| 0 | 7585 | |
| 2 | 4100 | |
| 3 | 1702 | 7.5% |
| 1 | 1692 | 7.4% |
| 4 | 91 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15170 | |
| Other Punctuation | 7585 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7585 | |
| 2 | 4100 | |
| 3 | 1702 | 11.2% |
| 1 | 1692 | 11.2% |
| 4 | 91 | 0.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7585 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22755 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 7585 | |
| 0 | 7585 | |
| 2 | 4100 | |
| 3 | 1702 | 7.5% |
| 1 | 1692 | 7.4% |
| 4 | 91 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22755 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 7585 | |
| 0 | 7585 | |
| 2 | 4100 | |
| 3 | 1702 | 7.5% |
| 1 | 1692 | 7.4% |
| 4 | 91 | 0.4% |
healthy_vegetables
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 16 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 168 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.72174263 |
| Minimum | 3 |
|---|---|
| Maximum | 18 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 11 |
| Q3 | 13 |
| 95-th percentile | 16 |
| Maximum | 18 |
| Range | 15 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.412720203 |
|---|---|
| Coefficient of variation (CV) | 0.3182990253 |
| Kurtosis | -0.5452287793 |
| Mean | 10.72174263 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.119201526 |
| Sum | 80724 |
| Variance | 11.64665985 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 835 | |
| 10 | 808 | |
| 12 | 798 | |
| 13 | 744 | |
| 9 | 688 | |
| 8 | 597 | |
| 14 | 596 | |
| 15 | 478 | |
| 7 | 450 | 5.8% |
| 6 | 369 | 4.8% |
| Other values (6) | 1166 |
| Value | Count | Frequency (%) |
| 3 | 118 | 1.5% |
| 4 | 192 | 2.5% |
| 5 | 261 | 3.4% |
| 6 | 369 | |
| 7 | 450 | |
| 8 | 597 | |
| 9 | 688 | |
| 10 | 808 | |
| 11 | 835 | |
| 12 | 798 |
| Value | Count | Frequency (%) |
| 18 | 143 | 1.9% |
| 17 | 132 | 1.7% |
| 16 | 320 | 4.2% |
| 15 | 478 | |
| 14 | 596 | |
| 13 | 744 | |
| 12 | 798 | |
| 11 | 835 | |
| 10 | 808 | |
| 9 | 688 |
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 136 |
| Missing (%) | 1.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.333156586 |
| Minimum | 5 |
|---|---|
| Maximum | 24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 6 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 14 |
| Maximum | 24 |
| Range | 19 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.023710728 |
|---|---|
| Coefficient of variation (CV) | 0.3628529906 |
| Kurtosis | 1.059070587 |
| Mean | 8.333156586 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.053599954 |
| Sum | 63007 |
| Variance | 9.142827034 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 1461 | |
| 6 | 1227 | |
| 7 | 885 | |
| 8 | 865 | |
| 9 | 829 | |
| 10 | 657 | |
| 11 | 509 | 6.6% |
| 12 | 364 | 4.7% |
| 13 | 272 | 3.5% |
| 14 | 170 | 2.2% |
| Other values (10) | 322 | 4.2% |
| (Missing) | 136 | 1.8% |
| Value | Count | Frequency (%) |
| 5 | 1461 | |
| 6 | 1227 | |
| 7 | 885 | |
| 8 | 865 | |
| 9 | 829 | |
| 10 | 657 | |
| 11 | 509 | 6.6% |
| 12 | 364 | 4.7% |
| 13 | 272 | 3.5% |
| 14 | 170 | 2.2% |
| Value | Count | Frequency (%) |
| 24 | 3 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 4 | 0.1% |
| 21 | 2 | < 0.1% |
| 20 | 13 | 0.2% |
| 19 | 16 | 0.2% |
| 18 | 29 | 0.4% |
| 17 | 58 | |
| 16 | 74 | |
| 15 | 122 |
total_fiber
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 40 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 419 |
| Missing (%) | 5.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.862463 |
| Minimum | 9 |
|---|---|
| Maximum | 48 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 30.2 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 28 |
| median | 32 |
| Q3 | 36 |
| 95-th percentile | 41 |
| Maximum | 48 |
| Range | 39 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 6.202386856 |
|---|---|
| Coefficient of variation (CV) | 0.1946612448 |
| Kurtosis | -0.07558012009 |
| Mean | 31.862463 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.3200424314 |
| Sum | 231895 |
| Variance | 38.46960449 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33 | 487 | 6.3% |
| 34 | 475 | 6.2% |
| 35 | 452 | 5.9% |
| 30 | 440 | 5.7% |
| 31 | 439 | 5.7% |
| 32 | 427 | 5.5% |
| 36 | 416 | 5.4% |
| 37 | 361 | 4.7% |
| 28 | 360 | 4.7% |
| 29 | 360 | 4.7% |
| Other values (30) | 3061 | |
| (Missing) | 419 | 5.4% |
| Value | Count | Frequency (%) |
| 9 | 1 | < 0.1% |
| 10 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
| 12 | 9 | 0.1% |
| 13 | 7 | 0.1% |
| 14 | 18 | |
| 15 | 20 | |
| 16 | 21 | |
| 17 | 42 | |
| 18 | 41 |
| Value | Count | Frequency (%) |
| 48 | 3 | < 0.1% |
| 47 | 11 | 0.1% |
| 46 | 23 | 0.3% |
| 45 | 38 | 0.5% |
| 44 | 66 | 0.9% |
| 43 | 86 | 1.1% |
| 42 | 132 | |
| 41 | 153 | |
| 40 | 211 | |
| 39 | 304 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | SampleID | incident_diabetes | diabetes_time | age | male | BMI | HDL | LDL | trig | SBP | DBP | hypertension | fasting | fasting_glucose | fasting_insulin | HbA1c | current_smoker | ex_smoker | exercise | healthy_vegetables | junk_food | total_fiber | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 0 | 14.82 | 69 | 0 | 43.784050 | 1.60 | 3.88 | 1.85 | 178.0 | 79.0 | 0.0 | 5.0 | 6.91 | 19.600000 | 37.0 | 0.0 | 0 | 3.0 | 15.0 | 7.0 | 41.0 |
| 1 | 2 | 3 | 0 | 14.82 | 72 | 1 | 23.035959 | 1.55 | 2.97 | 1.12 | 156.0 | 75.0 | 1.0 | 6.0 | NaN | NaN | 37.0 | 0.0 | 0 | 3.0 | 8.0 | 6.0 | 32.0 |
| 2 | 3 | 4 | 1 | 2.20 | 68 | 0 | 39.421661 | 1.20 | 2.80 | 2.33 | 154.0 | 80.0 | 0.0 | 4.0 | 8.83 | 33.400002 | NaN | 0.0 | 0 | 1.0 | 13.0 | NaN | 35.0 |
| 3 | 4 | 5 | 0 | 14.82 | 60 | 0 | 27.896681 | 1.70 | 2.98 | 1.29 | 121.0 | 77.0 | 0.0 | 6.0 | 5.86 | 8.800000 | 38.0 | 0.0 | 0 | 2.0 | 9.0 | 8.0 | 37.0 |
| 4 | 5 | 6 | 0 | 14.82 | 25 | 1 | 24.795719 | 1.25 | 3.10 | 0.84 | 139.0 | 78.0 | 0.0 | 15.0 | NaN | NaN | 27.0 | 0.0 | 1 | 2.0 | 11.0 | 6.0 | 34.0 |
| 5 | 6 | 7 | 0 | 14.82 | 59 | 0 | 26.264620 | 1.77 | 2.80 | 0.83 | 139.0 | 64.0 | 0.0 | 4.0 | 6.72 | 8.000000 | 34.0 | 0.0 | 0 | 2.0 | 16.0 | 10.0 | 45.0 |
| 6 | 7 | 8 | 1 | 6.12 | 59 | 0 | 38.279789 | 1.71 | 4.01 | 2.14 | 192.0 | 83.0 | 1.0 | 15.0 | NaN | NaN | 39.0 | 0.0 | 0 | 1.0 | 11.0 | 8.0 | 33.0 |
| 7 | 8 | 9 | 0 | 14.82 | 38 | 1 | 24.278509 | 1.28 | 2.63 | 1.32 | 118.0 | 63.0 | 0.0 | 4.0 | NaN | NaN | 33.0 | 0.0 | 0 | 3.0 | 9.0 | 6.0 | 28.0 |
| 8 | 9 | 10 | 0 | 14.82 | 58 | 0 | 24.270260 | 1.47 | 3.80 | 1.52 | 167.0 | 76.0 | 0.0 | 5.0 | NaN | NaN | 34.0 | 0.0 | 0 | 2.0 | 12.0 | 9.0 | 37.0 |
| 9 | 10 | 11 | 0 | 14.82 | 27 | 0 | 31.679911 | 1.35 | 3.33 | 0.97 | 121.0 | 65.0 | 0.0 | 4.0 | NaN | NaN | 32.0 | 0.0 | 0 | 2.0 | 10.0 | 13.0 | 35.0 |
Last rows
| df_index | SampleID | incident_diabetes | diabetes_time | age | male | BMI | HDL | LDL | trig | SBP | DBP | hypertension | fasting | fasting_glucose | fasting_insulin | HbA1c | current_smoker | ex_smoker | exercise | healthy_vegetables | junk_food | total_fiber | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7687 | 8281 | 8282 | 0 | 14.94 | 27 | 0 | 21.600662 | 2.06 | 2.39 | 1.21 | 123.0 | 70.0 | 0.0 | 5.0 | NaN | NaN | 34.0 | 0.0 | 0 | 3.0 | 13.0 | 10.0 | 32.0 |
| 7688 | 8282 | 8283 | 0 | 14.94 | 46 | 1 | 23.605682 | 0.95 | 3.26 | 3.23 | 118.0 | 82.0 | 0.0 | 6.0 | 5.69 | 12.0 | 37.0 | 0.0 | 0 | 2.0 | 10.0 | 15.0 | 33.0 |
| 7689 | 8283 | 8284 | 0 | 2.98 | 57 | 0 | 25.188099 | 1.45 | 2.80 | 1.41 | 157.0 | 87.0 | 0.0 | 5.0 | 6.25 | 6.0 | 40.0 | 0.0 | 1 | 3.0 | 14.0 | 5.0 | 39.0 |
| 7690 | 8284 | 8285 | 1 | 7.02 | 59 | 0 | 29.174709 | 1.21 | 3.24 | 2.53 | 138.0 | 82.0 | 0.0 | 6.0 | 6.37 | 10.3 | 42.0 | 0.0 | 0 | 2.0 | 16.0 | 5.0 | 44.0 |
| 7691 | 8285 | 8286 | 1 | 6.06 | 64 | 1 | 23.524204 | 1.24 | 2.69 | 1.37 | 168.0 | 73.0 | 1.0 | 5.0 | 6.80 | 11.0 | 46.0 | 0.0 | 1 | 2.0 | 6.0 | 5.0 | 21.0 |
| 7692 | 8286 | 8287 | 0 | 14.89 | 35 | 1 | 21.626297 | 1.70 | 3.02 | 1.01 | 129.0 | 72.0 | 0.0 | 4.0 | NaN | NaN | 33.0 | 0.0 | 0 | 4.0 | 8.0 | 13.0 | 27.0 |
| 7693 | 8287 | 8288 | 0 | 10.54 | 69 | 1 | 23.877653 | 1.53 | 4.87 | 1.44 | 125.0 | 70.0 | 0.0 | 7.0 | 6.10 | 4.8 | 37.0 | 0.0 | 0 | 2.0 | 11.0 | 9.0 | 33.0 |
| 7694 | 8288 | 8289 | 0 | 14.89 | 44 | 1 | 26.055933 | 1.53 | 3.61 | 0.93 | 131.0 | 75.0 | 1.0 | 4.0 | NaN | NaN | 36.0 | 0.0 | 0 | 3.0 | 7.0 | 11.0 | 27.0 |
| 7695 | 8289 | 8290 | 0 | 14.89 | 30 | 0 | 25.044785 | 1.63 | 2.11 | 0.85 | 123.0 | 71.0 | 0.0 | 5.0 | NaN | NaN | 37.0 | 0.0 | 0 | 3.0 | 10.0 | 8.0 | 30.0 |
| 7696 | 8290 | 8291 | 0 | 14.89 | 27 | 0 | 21.744848 | 1.20 | 2.44 | 1.61 | 134.0 | 64.0 | 0.0 | 2.0 | NaN | NaN | 32.0 | 1.0 | 0 | 1.0 | 11.0 | 7.0 | 28.0 |